Comb filter decomposition for robust ASR
نویسندگان
چکیده
The harmonic structure of the voiced speech is an effective way of conveying information in a way that is robust to white Gaussian additive noise. In this paper we propose Comb Filter Decomposition (CFD), a new method for approximating the magnitude of the speech spectrum in terms of its harmonics, which first leads to a new interpretation of the normalized autocorrelation function. Then we introduce some feature extraction methods based on CFD and on standard autocorrelation, that emphasize the harmonic peaks of the speech spectrum. The results show an improved ASR performance under noisy conditions.
منابع مشابه
Robust Speech and Bird Song Processing using Multi-band Correlograms and Sparse Representations
of the Dissertation Robust Speech and Bird Song Processing using Multi-band Correlograms and Sparse Representations by Lee Ngee Tan Doctor of Philosophy in Electrical Engineering University of California, Los Angeles, 2014 Professor Abeer Alwan, Chair This dissertation focuses on algorithms for robust speech and bird song processing. Many applications perform well under ideal signal conditions,...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملPitch Estimation of Song Sounds Using Parallel-Connected Comb Filters and Singular Value Decomposition
A new pitch estimation method for song sounds using twelve comb filters and singular value decomposition (SVD) processing is presented. This method is based on the ability of a comb filter, corresponding to one tone, to eliminate the tone’s pitch and harmonic frequencies, and also on the relationship between the number of singular values obtained from the SVD and the number of the signal freque...
متن کاملEfficient Polyphase Decomposition of Comb Decimation Filters in Analog-to-digital Converters
H. Aboushady, Y. Dumonteix, M. M. Louërat and H. Mehrez Université Paris VI, Laboratoire LIP6/ASIM 4, Place Jussieu, 75252 Paris Cedex 05, France Email: [email protected] , [email protected] Abstract—A power efficient multi-rate multi-stage Comb decimation filter for mono-bit and multi-bit A/D converters is presented. Polyphase decomposition in all stages, with high decimation fa...
متن کاملNon-stationary signal processing and its application in speech recognition
The most widely used acoustic feature extraction methods of current automatic speech recognition (ASR) systems are based on the assumption of stationarity. In this paper we extensively evaluate a recently introduced filter stable, non-stationary signal processing method, which relies on an adaptive parttone decomposition of voiced speech to obtain alternative feature vectors for ASR. The non-st...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005